Parallel Knowledge Discovery Using Domain Generalization Graphs
نویسندگان
چکیده
Multi-Attribute Generalization is an algorithm for attribute-oriented induction in relational databases using domain generalization graphs. Each node in a domain generalization graph represents a diier-ent way of summarizing the domain values associated with an attribute. When generalizing a set of attributes, we show how a serial implementation of the algorithm generates all possible combinations of nodes from the domain generalization graphs associated with the attributes, resulting in the presentation of all possible generalized relations for the set. We then show how the inherent parallelism in domain generalization graphs is exploited by a parallel implementation of the algorithm. Signiicant speedups were obtained using our approach when large discovery tasks were partitioned across multiple processors. The results of our work enable a database analyst to quickly and eeciently analyze the contents of a relational database from many diierent perspectives.
منابع مشابه
Accounting for Domain Knowledge in the Construction of a Generalization Space
Our study registers in the framework of the automatic construction of classiications. We tackle an issue which has been less explored , that of the discovery of classiications. To tackle this problem we have chosen to pursue and develop the works of Mineau in the domain of the organization of knowledge bases using generalization 20]. We propose an original approach, called COING, to the discove...
متن کاملManufactured in The Netherlands . Data Mining in Large Databases Using DomainGeneralization
Attribute-oriented generalization summarizes the information in a relational database by repeatedly replacing speciic attribute values with more general concepts according to user-deened concept hierarchies. We introduce domain generalization graphs for controlling the generalization of a set of attributes and show how they are constructed. We then present serial and parallel versions of the Mu...
متن کاملDesigning an Ontology for Knowledge Discovery in Iran’s Vaccine
Ontology is a requirement engineering product and the key to knowledge discovery. It includes the terminology to describe a set of facts, assumptions, and relations with which the detailed meanings of vocabularies among communities can be determined. This is a qualitative content analysis research. This study has made use of ontology for the first time to discover the knowledge of vaccine in Ir...
متن کاملA generalization of zero-divisor graphs
In this paper, we introduce a family of graphs which is a generalization of zero-divisor graphs and compute an upper-bound for the diameter of such graphs. We also investigate their cycles and cores
متن کاملExpert Discovery: A web mining approach
Expert discovery is a quest in search of finding an answer to a question: “Who is the best expert of a specific subject in a particular domain within peculiar array of parameters?” Expert with domain knowledge in any field is crucial for consulting in industry, academia and scientific community. Aim of this study is to address the issues for expert-finding task in real-world community. Collabor...
متن کامل